Building Concept Frames based on Text Corpora
نویسنده
چکیده
Linguists have been using different kinds of frame representation since the emergence of the notion “frame”. The main goal of the annotation system described in this paper is to provide an interactive and easy-to-use tool for structuring concept-specific information in linguistic frames for discourse analysis or cultural studies. These frames take into account background or “world” knowledge associated with the concepts, which is not necessarily present in lexicographic frames. A frame hierarchy providing default information, example texts containing specific information on a concept, and the annotations made by a user are combined together in one database. All frames have a predefined structure, and the information they contain is represented in natural language. The collected information can also be used as input to knowledge bases, or for defining patterns for Information Extraction.
منابع مشابه
OPTIMUM PERFORMANCE-BASED DESIGN OF CONCENTRICALLY BRACED STEEL FRAMES SUBJECTED TO NEAR-FAULT GROUND MOTION EXCITATIONS
This paper presents a practical methodology for optimization of concentrically braced steel frames subjected to forward directivity near-fault ground motions, based on the concept of uniform deformation theory. This is performed by gradually shifting inefficient material from strong parts of the structure to the weak areas until a state of uniform deformation is achieved. In this regard, to ove...
متن کاملA System for Building FrameNet-like Corpus for the Biomedical Domain
Semantic Role Labeling (SRL) plays an important role in different text mining tasks. The development of SRL systems for the biomedical area is frustrated by the lack of large-scale domain specific corpora that are annotated with semantic roles. In our previous work, we proposed a method for building FramenNet-like corpus for the area using domain knowledge provided by ontologies. In this paper,...
متن کاملArabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents
Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...
متن کاملBuilding a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
This paper reports on the design and construction of a bio-event annotated corpus which was developed with a specific view to the acquisition of semantic frames from biomedical corpora. We describe the adopted annotation scheme and the annotation process, which is supported by a dedicated annotation tool. The annotated corpus contains 677 abstracts of biomedical research articles.
متن کاملOPTIMAL PERFORMANCE-BASED SEISMIC DESIGN OF COMPOSITE BUILDING FRAMES WITH RC COLUMNS AND STEEL BEAMS
Composite RCS building frames integrate reinforced concrete columns with structural steel beams to provide an efficient solution for the design and construction of earthquake-resisting structures. In this paper, an optimization framework is developed for performance-based seismic design of planar RCS moment resisting frames. The objective functions are defined as minimizing the construction cos...
متن کامل